Examining characteristics of predictive models with imbalanced big data
نویسندگان
چکیده
منابع مشابه
Predictive Data Mining for Highly Imbalanced Classification
The paper addresses some theoretical and practical aspects of data mining, focusing on predictive data mining, where two central types of prediction problems are discussed: classification and regression. Further accent is made on predictive data mining, where the time-stamped data greatly increase the dimensions and complexity of problem solving. The main goal is through processing of data (rec...
متن کاملAn insight into imbalanced Big Data classification: outcomes and challenges
BigData applications are emerging during the last years, and researchers frommany disciplines are aware of the high advantages related to the knowledge extraction from this type of problem. However, traditional learning approaches cannot be directly applied due to scalability issues. To overcome this issue, the MapReduce framework has arisen as a “de facto” solution. Basically, it carries out a...
متن کاملExamining Predictive Accuracy Among Discounting Models
Both descriptive and normative arguments claim that the discount rate to be applied to public projects should be elicited from individual intertemporal preferences. We present a methodology to analyze data from experimental surveys on intertemporal preferences. Focusing on the exponential and the hyperbolic discounting models, we model the experimental data published by Thaler (1981) by means o...
متن کاملUnlocking the Value of Healthcare’s Big Data with Predictive Analytics
Background The volume of electronic data in the healthcare industry continues to grow. Adoption of electronic solutions and increased connectivity are driving higher levels of information capture than ever before. Storing data, however, holds little value unless it can be accessed, analyzed and put into action. Such is the quandary of today’s healthcare environment, where there is a massive vol...
متن کاملBuilding Useful Models from Imbalanced Data with Sampling and Boosting
Building useful classification models can be a challenging endeavor, especially when training data is imbalanced. Class imbalance presents a problem when traditional classification algorithms are applied. These algorithms often attempt to build models with the goal of maximizing overall classification accuracy. While such a model may be very accurate, it is often not very useful. Consider the d...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Big Data
سال: 2019
ISSN: 2196-1115
DOI: 10.1186/s40537-019-0231-2